On the complexity of inducing categorical and quantitative association rules

نویسندگان

  • Fabrizio Angiulli
  • Giovambattista Ianni
  • Luigi Palopoli
چکیده

Inducing association rules is one of the central tasks in data mining applications. Quantitative association rules induced from databases describe rich and hidden relationships to be found within data that can prove useful for various application purposes (e.g., market basket analysis, customer profiling, and others). Although association rules are quite widely used in practice, a thorough analysis of the related computational complexity is missing. This paper intends to provide a contribution in this setting. To this end, we first formally define quantitative association rule mining problems, which include boolean association rules as a special case; we then analyze computational complexity of such problems. The general problem as well as some interesting special cases are considered.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Finding Association Rules From Quantitative Data Using Data Booleanization - Imberman and Domanski

Finding association rules in data that is naturally binary has been well researched and documented. Finding association rules in numeric/categorical data has not been as easy. Many quantitative algorithms work directly on the numeric data limiting the complexity of the generated rules. In addition, as you create intervals from the numeric data the dimensionality of the problem increases signifi...

متن کامل

On the Complexity of Mining Association Rules

In this paper we describe our ongoing research towards establishing the complexity of mining association rules from relational databases. We consider both quantitative, categorical and boolean association rules and various forms of quality indexes, including confidence, support, gain, laplace. The presented results show that all these problems are, generally, computationally hard to solve, even...

متن کامل

A new approach based on data envelopment analysis with double frontiers for ranking the discovered rules from data mining

Data envelopment analysis (DEA) is a relatively new data oriented approach to evaluate performance of a set of peer entities called decision-making units (DMUs) that convert multiple inputs into multiple outputs. Within a relative limited period, DEA has been converted into a strong quantitative and analytical tool to measure and evaluate performance. In an article written by Toloo et al. (2009...

متن کامل

Mining association rules from qualitative and quantitative clustering

A comparison of mining association rules from clusters generated by qualitative clustering and clusters obtained by quantitative clustering is presented. Whereas in quantitative clustering only numerical data are included, numerical and categorical data are used in qualitative clustering for record conglomeration. The aim of this paper is to compare the performance of the two different kinds of...

متن کامل

Optimizing Membership Functions using Learning Automata for Fuzzy Association Rule Mining

The Transactions in web data often consist of quantitative data, suggesting that fuzzy set theory can be used to represent such data. The time spent by users on each web page is one type of web data, was regarded as a trapezoidal membership function (TMF) and can be used to evaluate user browsing behavior. The quality of mining fuzzy association rules depends on membership functions and since t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Theor. Comput. Sci.

دوره 314  شماره 

صفحات  -

تاریخ انتشار 2004